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Abstract 

We know that the marginals in a Dirichlet distribution are beta 
variates exhibiting a negative correlation. But we can construct two 
linear combinations of such marginals in such a way to obtain a positive 
correlation. We discuss the restrictions that are to be imposed on 
the parameters to accomplish such a result. In the case the sampling 
from the Dirichlet distribution is performed through a generalization of 
Johnk's method we discuss the efficiency of the algorithm implementing 
the method. 

1 Introduction 

Let {Xi, X2, X^} be distributed according to a Dirichlet distribution (see, 
for example, |2|, pages 231-235]), which will be denoted by {Xi, X2, V3} ~ 
2? (3, oq, ai, a2, 03). To fix notation in this case 



^X/i — o ^ % — 1 , 2 ^ 3 



where Zj, j = 0, . . . , 3 are independent gamma variables: Zj ~ ^(A, qj). 
Define 

Xi + X3, 

X2 + ^3- 



It follows 



Yi ~ Be{ai + as, ao + a2) 
Y2 ~ Be{a2 + as, ao + ai) 
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where Be{-, •) denotes a beta distribution. Set 7 = ao + + ck2 + "3- Then 
we have 



Cov{Xi, X2) = - 
Cov{Xi, X3) = - 
CoviX2, X3) = - 



0:102 
7^(7 + 1) 

7^(7 + 1) 
02 "3 

7^(7 + !)■ 



Since 
we have 

Then 



"3(7 - "3) 



VariXs) 



Cov{Yi, Y2) 



72(7 + 1) • 

-aia2 — aios — 0203 + 0:3(7 — as) 

7^(7 + 1) 
-0:10:2 + 0:00:3 



72(7 + 1) • 

It follows that the correlation coefficient is given by 

—0:10:2 + 0:00:3 



p{Yi, Y2) = 



^/{al + o:3)(o:o + o:2)(a2 + o:3)(o:o + oi) ' 



and the correlation is positive if 000:3 > aia2. 

Suppose now that we want to sample from a bivariate density with beta 
marginals, with parameters, respectively, ci, C2 and C3, C4, and a given pos- 
itive correlation coefficient r. To fit into the previous framework we set 

= ai + 03, 
= 00 + 0:2, 
= 0:2 + 0:3. 



Then 

which implies 



C4 = ao + cti = ci + C2 — C3, 

Ci + C2 > C3. 
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Furthermore 

— 0102 + O0O3 



v/ciC2C3(ci + C2 - C3) 

We assume r > 0. We solve for {ao, oi, 02, 03}, as functions of {ci, C2, C3, r}. 
We get 

r^CiC2C3(ci + C2 - C3) + C1C3 

03 = . 1) 

Cl + C2 



It follows 03 > 0, as required. And 



= Cl- 03, 

= C3 - 03, (2) 
= C2 - C3 + 03. 

All the parameters have to be positive, so we must have 

Cl - 03 > 0, 

02-03 > 0, (3) 

^ C2 - C3 + 03 > 0. 

To determine in the general case the conditions upon which these restrictions 
are satisfied is rather cumbersome. We analyze in details some particular 
cases. 

I Case: ci = C3. In this case the two marginal have the same distribution 
Be{ci, C2). 
We have 



rJcfC2{ci + C2 - Cl) + cf 

03 = ■ 

Cl + C2 

_ ci(ci + rc2) 

Cl + C2 

We see that the first two condition in Equation |^ are identical and 

c? + C1C2 — rciC2 — c? 
Cl - 03 = ■ 

Cl + C2 

_ ciC2(l - r) 

Cl + C2 

> 0. 
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As for the third condition we have 

c| + rciC2 

C2 - C3 + 03 = ■ 

Cl + C2 

> 0. 

We can conclude that in this case we do not have any restrictions on the 
parameters. 

II Case: 02 = 03. 

In this case ci + C2 — C3 > 0. We have 

rciC2 + C1C2 
"3 = ; • 

Cl + C2 

1. 

ci-a3 = ci(ci - rc2). 
To be greater than zero it requires 

Cl > rc2- 

2. 

C2(c2 - rci) 

C3 - a3 = C2 - Q!3 ^ 



Cl + C2 

To be greater than zero it requires 

C2 > rci. 

The two conditions are then satisfied if 



rc2 < Cl < — . (4) 
r 

3. 

C2 — C3 + a3 = 0:3 > 0. 

Ill Case: ci = C2. 

In this case ci + C2 — C3 = 2ci — C3, so we must have S = 2ci — C3 > 0, which 
imphes ^ > ^■ 
We have 

rcis/c^ + C1C3 

Ct3 = . 

2ci 
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1. 

S — i'\/c^ 
ci - as = ^ — — • 

To be greater than zero it requires 



S > r^ycsS, 

that is (5 > r^C3. That means 

, C3(l + r^) 
ci> 2 • 

Because 

C3(l + r2) 
C3> ^ , 

if we have ci > cs the above condition is always satisfied. 

2. 

C2 - as = Cl - Q!3, 

same as before. 

3. 

Cs - 

Cs - as = 2 • 

To be greater than zero we must have 



Cs > r\/cs6, 

that is 

Cs > r^S, 

which leads to 

C3(l + r2) 

Putting the two conditions together we have 

cs(l + r^) ^ ^ cs(l + r^) 
2 < < 2r2 ' 

which can be rewritten as 

1 + ci 1 + 
— z — < — < 



cs 2r2 
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C2 - C3 + Q!3 = Cl - C3 + as. 

ci [2ci - cs + rVc^] 

Cl - cs + as = — ^ i 

2ci 

6 + r-v/c3(5 
2 

> 0. 

IV Case: ci = C2 = C3 
Then 



1. 



2. 
3. 



2 Efficiency with Johnk's method 

If in sampling from the Dirichlet distribution we use a generahzation of 
Johnk's method (see, for example, [|^, pages 136-137]), that is 

1. generation of 4 independent uniform variates, Ui, i = 0, . . . , 3. 

1 

2. evaluation of Zi = U"' , i = 0, . . . , 3, where {ui} are given by Equa- 
tion [l|, Equation ^, under restrictions as in Equation ^. 

3. conditioning upon S = J2i=o — 1) the variables Xj = i = 1, 2, 3 
possess the required distribution P(3, ao, ai, 02, 03)) 

then the efficiency, that is PIS' < 1}, is given by 

(eLo«0^(eLo«0' 

To give an idea we present the following data, referred to the case ci =03. 
Table 1 refers to the case where r = 0.5, Table 2 when r = 0.75. 



4. 

We get 



as 



ci(l +r) 



Cl - 03 



2 

ci(l - r) 



> 0. 



C3 - "3 = Cl - 03 > 0. 



C2 — C3 + Q!3 = 03 > 0. 
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Table 1. Efficiency with Johnk's method 



for selected values of ci = C3, C2 and r = 0.50 



Cl 


C2 


0.23 


0.25 


0.5 


0.75 


1.00 


2.00 


3.00 


5.00 


0.25 


0.907 


0.897 


0.821 


0.763 


0.716 


0.596 


0.526 


0.444 


0.50 


0.833 


0.822 


0.701 


0.612 


0.544 


0.382 


0.298 


0.213 


0.75 


0.778 


0.763 


0.612 


0.506 


0.428 


0.257 


0.179 


0.108 


1.00 


0.734 


0.717 


0.544 


0.428 


0.345 


0.180 


0.113 


0.058 


1.50 


0.668 


0.647 


0.447 


0.323 


0.243 


0.098 


0.050 


0.019 


2.00 


0.619 


0.596 


0.381 


0.257 


0.180 


0.058 


0.024 


0.007 


3.00 


0.552 


0.526 


0.299 


0.179 


0.113 


0.020 


0.007 


0.001 


5.00 


0.472 


0.444 


0.213 


0.108 


0.058 


0.007 


0.001 


0.000 



Table 2. Efficiency with Johnk's method 



for selected values of ci = C3, C2 and r = 0.75 



Cl 


C2 


0.23 


0.25 


0.5 


0.75 


1.00 


2.00 


3.00 


5.00 


0.25 


0.916 


0.909 


0.843 


0.793 


0.752 


0.646 


0.583 


0.507 


0.50 


0.854 


0.843 


0.734 


0.654 


0.592 


0.441 


0.360 


0.273 


0.75 


0.806 


0.793 


0.654 


0.558 


0.482 


0.315 


0.234 


0.155 


1.00 


0.768 


0.752 


0.592 


0.482 


0.403 


0.232 


0.158 


0.090 


1.50 


0.710 


0.691 


0.503 


0.381 


0.298 


0.138 


0.079 


0.035 


2.00 


0.668 


0.646 


0.441 


0.315 


0.233 


0.088 


0.043 


0.015 


3.00 


0.607 


0.583 


0.361 


0.239 


0.157 


0.043 


0.016 


0.004 


5.00 


0.534 


0.507 


0.273 


0.155 


0.091 


0.015 


0.004 


0.000 
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